Determining Genetic Causal Variants Through Multivariate Regression Using Mixture Model Penalty
نویسندگان
چکیده
With the availability of high-throughput sequencing data, identification of genetic causal variants accurately requires the efficient incorporation of function annotation data into the optimization routine. This motivates the need for development of novel methods for genome wide association studies with special focus on fine-mapping capabilities. A penalty function method that is simple to implement and capable of integrating functional annotation information into the estimation procedure, is proposed in this work. The idea is to use the prior distribution of the effect sizes explicitly as a penalty function. The estimates obtained are shown to be better correlated with the true effect sizes (in comparison with a few existing techniques). An increase in the positive and negative predictive value is demonstrated using Hapgen2 simulated data.
منابع مشابه
Model selection based on logistic regression in a highly correlated candidate gene region
Our aim is to develop methods for identifying a (causal) variant or variants from a dense panel of single-nucleotide polymorphisms (SNPs) that are genotyped on the evidence of previous studies. Because a large number of SNPs are in close proximity to each other, the magnitude of linkage disequilibrium (LD) plays an important role. Namely, highly correlated SNPs may hamper standard methods such ...
متن کاملDetermining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran
Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran Pahlavani, P., Assistant professor at School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran Raei, A., PhD Candidate of GIS at School of Surveying and Geospatial Engineering, College of Engineeri...
متن کاملDetermining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran
Determining Effective Factors on Forest Fire Using the Compound of Multivariate Adaptive Regression Spline and Genetic Algorithm, a Case Study: Golestan, Iran Pahlavani, P., Assistant professor at School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran Raei, A., PhD Candidate of GIS at School of Surveying and Geospatial Engineering, College of Engineeri...
متن کاملPenalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a ...
متن کاملPenalized regression approaches to testing for quantitative trait-rare variant association
In statistical data analysis, penalized regression is considered an attractive approach for its ability of simultaneous variable selection and parameter estimation. Although penalized regression methods have shown many advantages in variable selection and outcome prediction over other approaches for high-dimensional data, there is a relative paucity of the literature on their applications to hy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2018